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Abstract 

In this paper, we present an experimental and simulation based study to evaluate the use of full-duplex as 
a mode in practical IEEE 802.11 networks. To enable the study, we designed a 20 MHz multi-antenna OFDM 
full-duplex physical layer and a full-duplex capable MAC protocol which is backward compatible with current 
802.11. Our extensive over-the-air experiments, simulations and analysis demonstrate the following two results. 
First, the use of multiple antennas at the physical layer leads to a higher ergodic throughput than its hardware- 
equivalent multi-antenna half-duplex counterparts, for SNRs above the median SNR encountered in practical WiFi 
deployments. Second, the proposed MAC translates the physical layer rate gain into near doubling of throughput for 
multi-node single- AP networks. The two combined results allow us to conclude that there are potentially significant 
benefits gained from including a full-duplex mode in future WiFi standards. 
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I. Introduction 

Currently deployed wireless communications systems cannot transmit and receive on the same frequency 
band at the same time, i.e., networks do not operate in a single-channel full-duplex fashion. As a result, 
networks are either time-division duplex (e.g., WiFi) or frequency-division duplex (e.g., cellular). The 
key challenge in achieving true full-duplex communication is the large power differential between the 
"self-interference" created by a node's own radio transmission and the signal of interest originating from 
a distant node. The large power differential is simply because the self-interference signal has to travel 
much shorter distances compared to the signal of interest. As a result of the large power differential, 
the signal of interest is swamped by the self-interference in digital baseband due to finite resolution of 
analog-to-digital conversion. 

Full-duplex experimental demonstration for narrowband systems was first reported in 1998 [1]. Since 
then, multiple authors [2-11] have reported different methods and implementations for various single and 
multiple antenna extensions. However, till date none of the prior methods have reported experimental 
evidence to achieve long-enough communication ranges (best reported number in all prior literature is 
8 meters with line-of-sight) for full-duplex to be considered in WiFi-like systems. Our focus in this paper 
is to investigate if a practical WiFi system can leverage full-duplex gains for its typical communication 
range. Our experiment based analysis is the first to investigate the performance of full-duplex systems 
over the entire range of signal to noise ratio (SNR) values typical in WiFi communications. 

In this paper, we present a multi-antenna wideband PHY and MAC design to enable a practical 
full-duplex mode in WiFi. Via extensive over-the-air tests, we show that our design achieves higher 
throughputs than its hardware-equivalent half-duplex MIMO counterparts, for a significant portion of the 
WiFi communication range. Our contributions in design are two-fold. First, to reduce the self-interference, 
the PHY uses a combination of three methods: (i) passive suppression via appropriate placement of multiple 
antennas on a device, (ii) a per-subcarrier per-receive- antenna analog self-interference canceler for MIMO 
OFDM systems and (iii) a digital self-interference canceler implemented in baseband. Second, to gauge 
realistic gains in actual systems, the MAC design leverages legacy WiFi RTS/CTS packets to seamlessly 
support legacy half-duplex and new full-duplex modes. By design, the MAC is minimally different from 
IEEE 802.11 and is designed to leverage the existing 802.11 ecosystem to accelerate potential adoption 
of the full-duplex mode in future 802.11 revisions. 

En route to showing that our design provides rate gains for WiFi networks, we perform extensive 
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statistical characterization of the design elements, revealing several new findings. Our findings can be 
divided into three categories: (i) self-interference canceler performance in full-duplex, (ii) the comparison 
of empirical ergodic rates achieved by full- and half-duplex systems, and (iii) an extensive MAC layer 
performance analysis for different traffic scenarios. 

Results on self-interference cancellation in full-duplex: Recall that we are employing three mecha- 
nisms to reduce self-interference - passive suppression by antenna placement, and two active cancelers 
- one in analog and the other in digital baseband. The three mechanisms are concatenated serially 
to result in a three-stage design. The serial concatenation implies that each stage is operating on the 
residual signal of the previous stage. As a result, the performance of each stage is not independent of the 
performance of the stages prior to it. In general, if a stage cancels more of the self-interference, then the 
subsequent stages cancel less. Thus, in general, our results show that the total self-interference canceled 
by any two concatenated stages is not the sum of maximum self-interference canceled by each stage 
individually in isolation. The non-additive nature of concatenated cancellation techniques also demonstrates 
the challenge of completely suppressing self-interference - individually improving each stage does not 
guarantee equivalently better performance in the total system performance. 

Digging deeper to understand the role and interaction of each cancellation stage, we show following 
four results experimentally for different 20 MHz 64-subcarrier OFDM physical layers. First, we consider 
antenna placement for 2x 1 MISO full-duplex, where each node has three antennas - two transmit and one 
receive. By placing antennas around the device to use the device itself to attenuate self-interference and 
also leveraging antenna polarization, self-interference can be suppressed by an additional 15 dB compared 
to the configuration where there is no device. Thus, the key message is that placement of antennas is 
crucial in full-duplex devices. We note that our antenna placement aims to only increase the pathloss of 
self-interference and thus is highly robust to device size variations. In contrast, prior antenna placement 
techniques aim to create beamforming nulls [5, 10, 11], which are designed under the assumption that the 
self-interference channel does not have multi-path components. As a result, beam-forming based designs 
in [5, 10, 11] require self-interfering antennas to be either symmetrically spaced or placed at a distance 
which is a function of the frequency of operation. 

Second, passive device based suppression largely reduces the direct line-of-sight path for self-interference 
and thus the multi-path reflections become dominant. This becomes evident by the fact that with more pas- 
sive cancellation, the self-interference channel becomes more frequency-selective. The measured frequency- 
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selectivity was our motivation behind per-subcarrier analog canceler, which actively cancels self-interference 
in each OFDM band with sub-band specific cancellation coefficients. 

Third, we measure the performance of each cancellation stage and also consider its impact on subsequent 
stages. The measured results clearly show the above mentioned fact - more cancellation by one stage means 
lower cancellation possible in later stages. In [5], it was assumed that the performance of cancellation 
stages is additive. This assumption was then used to obtain an estimate of 73 dB of total analog plus 
digital cancellation. However, the prototype implemented in [5] achieved 30 dB of analog plus digital 
cancellation, which is 43 dB less than their estimated maximum of 73 dB. Hence, the total cancellation 
of concatenated cancellation stages did not equal the sum of the cancellation achieved by each stage 
individually in isolation. Thus, we believe that our conclusions are qualitatively typical for any hardware 
implementation, which uses serial concatenation of different cancellation schemes. 

Lastly, we combine all three methods of cancellation (passive, analog, and digital) and demonstrate that 
our three-stage self-cancellation system achieves a median cancellation of 85 dB, with minimum of 70 dB 
and a maximum of 100 dB. The median and maximum numbers for total self-interference cancellation 
are the best reported numbers in the literature till date. We note the importance of studying the statistical 
properties of the cancelers. All cancellation mechanisms rely on some form of channel estimation to adjust 
its cancellation weights and thus have to deal with noise introduced by RF and baseband (e.g. in the form 
of quantization noise) stages. Thus, no cancellation mechanism can be guaranteed to achieve a constant 
cancellation in all cases, and will exhibit statistical variations. 

Results on Ergodic rate Comparisons: We implemented two full-duplex physical layers -2x1 MISO 
and lxl SISO, and three half-duplex systems - 2x1 MISO, 3x1 MISO and 2x2 MIMO. The RF 
hardware usage of the five systems is compared by counting the total number of RF up-conversion and 
down-conversion chains. A 2x2 MIMO half-duplex uses 2 up-converting and 2 down-converting chains 
for a total of 4 chains. A 2x 1 MISO full-duplex uses 3 up-converting and 1 down-converting chains, again 
for a total of 4 chains. Similarly all other configurations mentioned above use 4 or fewer total chains. 
The main motivation for using RF hardware equivalence is that in most portable devices, the power 
consumption of RF is a key factor and thus often determines the largest supported antenna configurations. 

We perform extensive experiments which allow us to compare the performance of full-duplex and 
half-duplex systems for SNR values from to 40 dB. In WiFi systems, the received signal of interest 
power is typically between —80 dBm and —60 dBm, and the noise floor is around —90 dBm. Hence, the 
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range of operation for WiFi systems corresponds to SNR values lower than 30dB. We observe that for a 
significant fraction of the WiFi SNR range of operation (more specifically, SNR values greater than 20 
dB), 2x1 full-duplex can often outperform the rest of the four configurations. In terms of multiplexing 
gain, 2x1 full-duplex and 2x2 half-duplex should have the same multiplexing gain of two. However, the 
measured multiplexing gain of 2x2 half-duplex is often less than two, and here again 2x1 full-duplex 
achieves a higher measured multiplexing gain. While surprising at first, the result is easily explained by 
the distribution of condition numbers of channel matrices observed in our extensive indoor tests, 1 and 
match the results for half-duplex MIMO systems observed in other experiments [12]. 

Results on MAC Layer: Recall that a primary design objective for the MAC was to make minimal 
changes to legacy 802.11 MAC to extract the advantages of full-duplex technology. The MAC design 
supports both legacy half-duplex and full-duplex flows without hurting the throughput for half-duplex 
nodes significantly. For the full-duplex flows, asymmetric packet sizes are also supported since the packet 
sizes in the two directions of a full-duplex transfer may be different. 

In 802.11, if a node cannot decode a frame successfully, it triggers a longer wait time due to the 
use of EIFS (Extended Inter Frame Spacing). In the full-duplex mode, the nodes other than the two 
nodes participating in full-duplex exchange, do not decode frames correctly. Thus the MAC design has 
modification to avoid waiting for EIFS in certain scenarios. The new MAC design is simulated in an 
OPNET based MAC simulator, which allowed us to use an industry- standard WiFi implementation and 
stay backward compatible with IEEE 802.11 MAC protocol. We focused on the full-buffer scenario to 
determine maximum throughput, and also examined fairness between the full-duplex and half-duplex 
nodes. 

Our results were obtained in four major steps. First, we evaluated the performance of a single AP 
communicating with one full-duplex flow. Full-Duplex MAC doubles the throughput of the system as 
compared to a legacy WiFi half-duplex communication using RTS/CTS signaling for a fixed total cancel- 
lation of 85 dB, propagation loss of 63dB, and symmetric traffic. Full-duplex MAC throughput increases 
by 87% as compared to legacy WiFi half-duplex system that does not use RTS/CTS signalling for packet 
size of 1500 bytes. We further investigated asymmetric packet sizes, where uplink data packet size can 
be different from downlink packet size for a full-duplex exchange. Keeping in mind that typical data 
communications uses TCP as the transport layer protocol, in which a one-way transfer of data would 

'We performed only indoor tests since most WiFi deployments are indoors. 
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typically have 1500 byte data packets with 40 byte acknowledgment in the reverse direction, we quantify 
the goodput performance for varying packet sizes. As the degree of asymmetry reduces, the throughput 
gains ranged from 30%-100% as compared to a legacy half-duplex system with RTS/CTS, and 18%— 87% 
as compared to a legacy half-duplex system without RTS/CTS. 

Second, we considered scaling of full-duplex system as one AP communicates with more full-duplex 
nodes. We first note that for a half-duplex system with four or more nodes, the use of RTS/CTS improves 
the goodput since data collisions (which trigger retransmission of the large data packets) are replaced by 
collisions of the short RTS frames. We find that the sum throughput for a full-duplex system increases 
by a factor of at-least two, when compared to a half-duplex system with RTS/CTS with the same number 
of nodes. 

Third, we consider the system performance when full-duplex nodes co-exist with half-duplex nodes, 
where the half-duplex nodes' MAC logic has been slightly modified to ignore collisions during a full- 
duplex transfer. This provides an insight into the dynamics of co-existence. This modification in half-duplex 
can be made by a shift in logic of timing without the need for new hardware and may or may not be 
pragmatic. We find that for a system with m full-duplex and m half-duplex nodes, the total throughput 
compared to a half-duplex-only system increases by a factor of 1 + m/(2m+ 1). The percentage increase 
in throughput for a mixed system as compared to a half-duplex system increases with m such that the 
maximum percentage increase can be up to 50%. The uplink and downlink throughputs of the full-duplex 
nodes in a mixed system are higher as compared to the uplink and downlink throughputs of a node in 
a system with 2m half-duplex nodes and no full-duplex node respectively. Thus, the improved hardware 
for full-duplex nodes provides a substantial improvement to the throughput of the full-duplex nodes. In 
addition, half-duplex nodes also achieve higher throughputs. The downlink throughput from AP to half- 
duplex nodes almost doubles when there is a mix of full- and half-duplex nodes, and the uplink throughput 
from half-duplex node to AP is also improved slightly as compared to the corresponding throughput in a 
purely half-duplex system. 

Finally, we consider the coexistence with legacy half-duplex nodes that have no modifications. Much 
like the above discussion, the uplink and downlink throughputs of full- duplex nodes increases, and so does 
the downlink throughput to HD nodes as compared to a purely half-duplex system. However, half-duplex 
nodes do not grab the channel as often as they would in a purely half-duplex system (where the total 
number of nodes in two cases are the same) leading to a decrease in their uplink throughput by around 
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40% for m = 2 as compared to purely half-duplex system. To increase probability of access, we change 
the proposed full-duplex MAC design to make it better throughput fair with the legacy half-duplex nodes. 
The above change in full-duplex MAC decreases throughput by around 2% for the full-duplex nodes (as 
compared to the case of modified half-duplex and unchanged full-duplex nodes) by making them less 
aggressive in lieu of increased probability of access for legacy half-duplex nodes which is almost the 
same as if all the nodes were half-duplex. 

The rest of the paper is organized as follows. In Section II, we describe the MIMO wideband canceller 
design which uses a combination of passive suppression and active cancellation techniques. In Section III, 
we describe the experimental setup for validating the design. Section IV and V evaluates the cancellation 
design in terms of cancellation and throughput respectively. In Section VI, we give our MAC design with 
detailed evaluations in Section VII. Section VIII concludes this paper. 

II. MIMO Wideband Canceller design 

We present a design for a wideband multiple antenna self-interference canceller which uses a combina- 
tion of passive suppression and active cancellation techniques, where passive suppression precedes active 
cancellation. The cancellation techniques are explained below. 

Passive Suppression (PS): Passive suppression is achieved by maximizing the attenuation of the self- 
interference signal due to propagation path loss over the self-interference channel, which is the channel 
between same node transmitter and receiver antennas. The amount of passive suppression depends on the 
distance between antennas, the antenna directionality, and the antenna placement on the full-duplex device. 
We use hj iJnjn to denote the self-interference channel between transmitter antenna m and receiver antenna 
n at node i. The self-interference channel, hj )m>n , varies with time and frequency due to changes in the 
node's environment. Our design of self-interference cancellation for OFDM systems will be presented 
in the frequency domain. We use h i:m:n [k] to denote the magnitude and phase that the self-interference 
channel hj m n applies to subcarrier k. For a system with K subcarriers the channel vector is defined as 

\m,n = [hi, m , n [l],K,m,n[ 2 ]>'''> h i,m,n[ K ]]- Fl g ure 1 snows tne two passive cancellation paths h iiM and 
hj,2,i for a full-duplex node with two transmitter antennas and one receiver antenna. 

Active Analog Cancellation (AC): As the name suggests, the active cancellation is performed in 
analog domain before the received signal passes through the Analog-to-Digital Converter (ADC). For 
an OFDM MIMO node, the self-interference signal received at Node i antenna n on subcarrier k after 
passive suppression is equal to y(£[k] = J2m=i hi,m,n[k]xi, m [k], where x itm [k] is the signal transmitted 
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from Node % on subcarrier k antenna m. Analog cancellation of the self-interference at receiver antenna 
n is implemented by subtracting an estimate of yf^[k\ from the received signal. 

In our proposed MIMO wideband canceller design, the additional hardware components required for 
active analog cancellation of the self-interference at one receiver antenna consist of one Digital-to-Analog 
converter (DAC), one up-converting radio chain (Tx Radio) which up converts the signal from Base Band 
(BB) to Radio Frequency (RF), one fixed attenuator, and one RF adder. Figure 1 shows a diagram of 
our proposed analog cancellation for a full-duplex node with two transmitter antennas and one receiver 
antenna. One input to the RF adder is the signal at the receiver antenna, and the other input is a canceling 
signal z i:fl local to node % which is input to the RF adder via a wire. For subcarrier k and receiver 
antenna n, the local signal z ij7l is equal to z ijn [k] = -h^ n [k] J2m=i h,m,n[ k \ x hm[ k ], where h^ n [k] denotes 
the magnitude and phase that affect a signal at subcarrier k when passing through the wire connected to 
the RF adder at node % receiver antenna n. Further, &j, m ,„[&;] denotes the cancellation coefficient for the 
self-interference received at antenna n from transmitter antenna m at subcarrier k at Node i. 

The self-interference at subcarrier k after analog cancellation at antenna n (this is the signal at the 
output of the RF adder connected to antenna n) is equal to yf£[k}= yfn[k] — z iyn [k], which can be 
rewritten as y$?[k]= J2m=i(Km,n[ k \ ~ h ™ n [ k \h™A k \) x i,rn[ k \- From the equation for y^[k\, we observe 
that active analog cancellation achieves perfect cancellation when bi >m , jn [k] = hi im>n [k]/h^ n [k]. In a real 
system, h ijmjn [k] and h^ n [k] can only be estimated, which leads to the following computation of 

bi, m ,n [k] = hi,m, n [k] / h™ n [ k ], ( 1 ) 

where hi jm , n [k] and h^ n [k] are the estimates of h ijmjn [k] and h^ n [k] respectively. Thus, cancellation is 
usually not perfect. The estimates of hi jm , jn [k] and ti^ n [k] are computed based on pilots sent from each 
transmitter radio on orthogonal time slots. 

In a WiFi system that uses RTS/CTS, the estimates of h iimin [k] and h^ n [k\ can be computed based on 
pilots sent during the RTS/CTS transmissions. Further, since h^ n [k] is a wire, it is a static channel and it 
does not need to be estimated often. While the RTS/CTS packet exchange adds overhead to the system, 
it enables full-duplex and results in overall rate gains as will be shown in Sections VI and VII. 

We note that any additional transmitter radio used for analog cancellation does not require a power 
amplifier since it is transmitting over a wire. However, for our specific implementation, the radio used 
for analog cancellation had a power amplifier which could not be removed. Hence, we used a fixed RF 



9 

attenuator connected in series, as shown in Figure 1, in order to reduce the signal power levels at the output 
of the canceller radio to the levels required for cancellation. The attenuator used was a passive device (part 
number PE7001 [13]) that attenuates all the frequencies in the band of interest by the same amount. The 
value set for the attenuator was a function of the antenna configuration used because different antenna 
configurations resulted in different levels of self-interference power at the receiver antenna; different 
antenna configurations have different amount of passive suppression as will be shown in Section IV-B. 
The four antenna configurations used are shown in Table I and will be explained in more detail in 
Section III-C. The attenuator was set equal to 35 dB for Antenna Placement 1 without device, 45 dB 
for Antenna Placement 1 with device, 50 dB for Antenna Placement 2 without device, and 55 dB for 
Antenna Placement 2 with device. The RF attenuator would not have been needed if the radio used for 
analog cancellation had a larger range of output powers and did not use a power amplifier by default. 

We highlight that the RF adder used for analog cancellation is a passive device (part number PE2014 [14]) 
and applies the same addition operation to all the frequencies in the band of interest. 

Digital Cancellation (DC): There is a residual self-interference yf^ [k] that remains after analog 
cancellation due to imperfect analog cancellation. Active digital cancellation estimates yf£[k\ and sub- 
tracts this estimate from the received signal in the digital domain. The estimate of yfn[k] * s computed 
based on a second round of pilots sent from each transmitter antenna and received while applying 
analog cancellation to each receiver antenna. Specifically, the second round of pilots is used to compute 
hi : m : n[k] — h^ n [k\bi^ n [k\. Alternatively, the estimate of yf£ [k] can be computed without extra pilots if 
implemented based on correlation between the transmitted and received self-interference payload signal. 

III. PHY Experiment Description 

In this section, we describe our experiment testbed, antenna configurations and physical layer techniques 
which will be compared and their implementation details on WARP [15]. 

A. Node Locations 

We used five nodes, labeled as nodes Na, Nb, Nc, Nd, and Ne. The nodes were placed at locations 
shown in Figure 2. Nodes Na, Nb, Nc, Nd, and Ne were located at a height of 1.5 m, 1.5 m, 1.4 m, 1.7 m 
and 2.0 m respectively, above the floor. Experiments were conducted in the second floor of a three-floor 
office building and were performed both at night and during office work hours with people walking in 
and out of the rooms. The five-node setup allowed us to evaluate ten different two-node links. The ten 
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link pairs, their inter-node distance and the type of channel for each link are shown in Table II. Our 
choices allowed us to create line-of-sight channels and also extremely challenging multi-wall propagation 
environments, which represented a typical Wi-Fi deployment. In contrast, the experiment setup in [11] 
was located at least 20 m from the the nearest wall hence, which does not capture some typical WiFi 
scenarios. For experiment results in [5, 7,11] the distance between communicating nodes was not reported. 

B. Full-duplex and Half-duplex Modes 

For each of the ten links, we ran experiments for the following physical layers: full-duplex lxl (FD1 x 1), 
full-duplex 2x1 (FD2xl), half-duplex 2x1 (HD2xl), half-duplex 3x1 (HD3xl), and half-duplex 2x2 
(HD2x2). Experiment results obtained for the above five systems have the necessary data to evaluate the 
performance of our full-duplex design and compare its performance with half-duplex systems which use 
the same or less radio resources per node. Notice that an HDM x N node needs M up-converting radio 
chains and N down-converting radio chains for a total of M + N radio chains. In contrast, our proposed 
FDM' x N' node uses M' up-converting radio chains for transmission, N' down-converting radio chains 
and N' up-converting radio chains for self-interference cancellation for a total M' + 2N' radio chains per 
node for any M', N' > 1. 

For all five PHY configurations listed above, the total number of chains is no more than 4. That is 
M + N < 4 for half-duplex systems and M' + 2N' < 4 for full-duplex systems. Table III shows the number 
of radios and antennas per node used by each of the full-duplex and half-duplex systems considered. We 
will compare the performance of full-duplex and half-duplex systems which use the same number of 
radios per node. The performance of FD2xl will be compared with the performance of HD3xl and 
HD2x2 systems. The performance of FD1 x 1 will be compared with the performance of HD2x 1. 

For the experiments with more than one transmitter antenna, the multiple antenna codes used were the 
following. For the FD2x 1 experiments we used an Alamouti code [16]. Hence, in Figure 1, the signals x itl 
and x i: 2 correspond to Alamouti encoded symbols. The HD2x 1 experiments also used an Alamouti code. 
The HD3 x 1 experiments used a rate 3/4 orthogonal space-time block code (OSTBC) from MATLAB 
MIMO library [17]. The HD2x2 experiments used spatial multiplexing for two spatial streams and the 
receive processing was implemented using channel inversion. 

We note that a FD2x 1 Alamouti implementation using our proposed wideband MIMO canceller requires 
three antennas per node which is less than what is required by the MIMO cancellation techniques proposed 
in [11]. The MIMO antenna cancellation technique in [11] would require at least four antennas per node. 
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The transmitter/receiver antenna cancellation technique proposed in [11] requires 6 antennas per node for 
implementation of a FD2xl Alamouti system. 

C. Multi-antenna Placements 

We considered two possible antenna placements for the full-duplex and half-duplex experiments. For 
each antenna placement we considered two cases: antennas with a device (a 15-inch Macbook Pro laptop) 
and without a device. Hence, we considered a total of four different configurations as shown in Table I. 
For all the configurations, Rl was used as the receive antenna for all the systems that used only one 
receiver antenna, i.e FDlxl, FD2xl, HD2xl, and HD3xl. For HD2x2, all the configurations used 
Rl and R2 as receiver antennas. For all the configurations and systems evaluated, if M antennas were 
required for transmission, we used antennas Tl to TM. 

The antennas used in experiments [18] are designed for 2.4 GHz operation, with vertical polarization, and 
have toroid-like radiation pattern shown in [18]. In Antenna Placement 1 (Al), the full-duplex experiments 
correspond to the case where the main lobe of the receiver antenna (Rl) is in the same direction as the 
main lobe of Tl and orthogonal to the main lobe of T2. In Antenna Placement 2 (A2), the full-duplex 
experiments correspond to the case where the receiver (Rl) main lobe is orthogonal to the main lobe 
of both Tl and T2. As experiments will demonstrate, the orthogonal placement of the transmitter and 
receiver main lobes in A2 will help reduce the self-interference. Hence A2 will result in larger passive 
suppression than Al. Experiment results in Section IV will also demonstrate and quantify the increase in 
passive suppression achieved by placing antennas appropriately around a device. 

D. Transmit Power Normalization 

For a fair comparison between full-duplex and half-duplex systems, the total energy transmitted by a 
full-duplex node must be the same as the total energy transmitted by a half-duplex node. Since energy is 
power times transmission time, the equation 

pFDrpFD _ pHDrpHD ^) 

% % % % ' 

defines the relationship between full- and half-duplex powers, where Pf D denotes the transmission power 
use by Node % in full-duplex mode, P^ D denotes the transmission power used by node % in half-duplex 
mode, Tf D denotes the duration of a transmission from node % in full-duplex mode, and T t HD denotes 
the duration of a transmission from node % in half-duplex mode. 
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Consider a finite duration, r, of time for bi-directional communication between Nodes 1 and 2. From 
time constraints for full-duplex and half-duplex we have that T[ D = T( D = r and T^ D + T 2 HD = r. 
We define (5 = Tf 10 jr. Using Eq. (2), the definition of (5, and the time constraints, we obtain that for 
a fair comparison between full-duplex and half duplex systems the node powers used in full-duplex and 
half-duplex must satisfy 

pFD = pHDp (3) 

P 2 FD = Pf D (l-/3). (4) 

Notice that Equations (3) and (4) do not impose any constraint on the maximum power assigned to 
a node. However, in real systems, the maximum instantaneous radiated power is limited and is typically 
defined in standards. Hence, in order to include practical considerations in our power assignment equations, 
we define n as the maximum power that can be radiated by the network (not just one node, but all the 
nodes in the network together) at any time. Since half-duplex transmissions from each node are orthogonal 
in time, it implies that in a network with two nodes % — 1, 2 the transmission powers must be such that 
P\ D < n and P 2 HD < n. In contrast, since full-duplex transmissions from each node are simultaneous, 
the instantaneous radiated power constraint of II translates to a power constraint of P[ D + P 2 FD < II for 
full-duplex nodes. Thus, we ensure that at any given time, a network with full-duplex nodes radiates the 
same power that would be radiated by a network with half-duplex nodes. 

All our experiments correspond to an instantaneous power constraint of II = 8 dBm and we achieve this 
constraint with equality. Hence, our experiments correspond to the following transmit power assignments: 
pHD = P HD = g dBm p fd = g dBm+101og 10 (/3), and P[ D = 8 dBm+101og 10 (l-/3). We performed 
only symmetric experiments, where j3 = 0.5, leading to Pf D = P 2 FD = 5 dBm. 

The radios used in our experiments can transmit at a maximum power of 25 dBm. However, we observed 
that the radio's transmitter power versus gain setting relation is linear only for output powers between 
dBm and 15 dBm for OFDM signals of 20 MHz bandwidth used in our experiments. Consequently, for 
our experiments we chose transmission powers which lie close to the middle of the linear range of the 
transmitter radios. Accounting for amplifier nonlinearities and their impact on cancellation coefficients, 
hmn, will be focus of future work. 
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E. WARP Implementation and Testbed Setup 

The digital and analog signal processing at a node were implemented using the WARPLab frame- 
work [15]. The WARPLab framework facilitates experiment implementation by allowing the use of 
MATLAB for digital signal processing and the use of WARP [15] hardware for real-time over-the-air 
transmission and reception. 

All full-duplex and half-duplex experiments were conducted at a 2.4 GHz Wi-Fi channel without any 
other concurrent traffic. In all our experiments the nodes shared the same carrier frequency reference 
clock. All systems implemented have a bandwidth of 20 MHz using 64 subcarriers with 48 subcarriers 
used for payload as specified in one of the possible Wi-Fi modes. 

For each of the ten links considered, we ran experiments with both nodes using the same antenna/device 
configuration, and we considered all the possible combinations for the ten different links and four possible 
configurations shown in Table I. Thus there were a total of 40 different scenarios. For each scenario and 
full-duplex/half-duplex system considered, an experiment consisted of transmitting 90 packets from each 
of the nodes in the link. Each packet transmitted consisted of 68 OFDM symbols (the number of OFDM 
symbols per packet was limited by buffer sizes in the WARPLab framework) and each subcarrier was 
modulated using QPSK. Since there were 48 payload subcarriers per OFDM symbol, the total number of 
bits transmitted per packet per node was equal to 6528 and the total number of bits transmitted per node 
in 90 packets was equal to 587,520. 

IV. PHY Evaluation: Canceller Performance 

In this section we characterize the performance of the self-interference cancellation stages. We demon- 
strate that our full-duplex design can achieve self-interference cancellation values, which can be larger 
than what has been reported in prior work. 

A. Metric for Canceller Analysis 

We measured the self-interference power after each stage of cancellation for each packet transmitted 
by a node in full-duplex mode. For each stage of cancellation, the amount of cancellation (in dB) was 
computed as the difference between the self-interference power before cancellation and the self-interference 
power after cancellation. The measurement of the self-interference power after each cancellation stage 
was computed based on the RSSI reading provided by the WARP radios. A more detailed explanation of 
the power measurements is provided in [19]. 
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B. Performance of Passive Suppression 

Result 1 (Gain from Antenna Placement and Orientation): The amount of passive suppression increases, 
by as much as 15 dB, for the placement where (a) the receiver antenna is placed orthogonal to the 
transmitter antennas responsible for self-interference and (b) the device-induced pathloss is increased. 

Figure 3(a) shows a characterization of the amount of passive suppression achieved by the four different 
configurations listed in Table I. First, we observe that at a CDF value of 0.5, configuration Al with device 
achieves approximately 10 dB better cancellation than Al without device. Similarly, at a CDF value of 
0.5, configuration A2 with device is observed to achieve approximately 10 dB better cancellation than 
A2 without device. Hence, we conclude that placing antennas around a device improves the passive 
suppression by approximately 10 dB. 

Second, we observe that at a CDF value of 0.5, configuration A2 with device achieves approximately 
5 dB better cancellation than Al with device. Similarly, A2 without device achieves approximately 5 dB 
better cancellation than Al without device. Hence, we conclude that antenna placement A2 improves the 
passive suppression by approximately 5 dB with respect to antenna placement Al. The reason for this 
improvement is due to the fact that in A2 the receiver antenna main lobe is placed orthogonal with respect 
to the transmitter antennas main lobe. Consequently, A2 results in less coupling between self-interfering 
antennas and this results in larger levels of passive suppression. 

Recent characterizations of passive suppression mechanisms [5, 6] demonstrate levels of passive sup- 
pression lower than 60 dB. Our results in Figure 3(a) show that taking into account the antenna pattern 
and placing the antennas around the full-duplex device serves as further means of passive suppression and 
helps achieve passive suppression values between 60 dB and 70 dB. Comparing the cancellation values 
for Al without device and A2 with device in Figure 3(a), we observe that through device cancellation 
and orthogonal antenna placement improve the amount of passive suppression by approximately 15 dB. 

Result 2 (Impact of Passive Suppression on Self-interference Channel): As the amount of passive sup- 
pression increases, the wireless self-interference channel becomes more frequency selective. 

In our implementation of the analog canceler we compute the cancellation coefficient per subcarrier, 
bi,m,n[k], as shown in Eq. (1). Hence, &j, m , n [A;] is the ratio of the estimate of the self-interference wireless 
channel h ijTnjn [k] and the wire channel h^ n [k]. Since the wire channel h^ n [k] is frequency flat, variations 
of the cancellation coefficient b itm>n [k\ as a function of the subcarrier index will be due to variations of 
the self-interference channel h iimjTt as a function of frequency. If h.^ m ,n is frequency flat then 6j, m , n [A;] 
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will be the same across all subcarriers. If h ijmj „ is frequency selective then bi jm , jn [k] will vary for different 
subcarriers. 

Figure 4(a) shows the magnitude of the cancellation coefficients, 6i, m , n [fc], for each of the 48 data 
subcarriers captured for two subsequent packets. The subcarrier spacing is 0.3125 MHz as in 802.11 for 
a 20 MHz bandwidth channel. We observe that as a function of subcarriers, the channel attenuation can 
vary significantly across frequency, and thus approximating self-interference channel as frequency flat can 
be highly inaccurate. 

To completely characterize the statistical variations in self-interference channel across frequency, we 
use the measure of peak-to-peak (p2p) value of the magnitude of the cancellation coefficient, \b i:m , n \ p2p , 
as follows, 
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If the self-interference channel hj m n is a flat frequency channel then \b iiTnin \ p2p = 1 and for a frequency 
selective channel \b i:fn:n \ p2p will be larger than 1. For each FD2xl experiment we computed the value 
of \bi ; m,n\ p2p between transmitter antenna 1 (Tl) and receiver antenna 1 (Rl). Figure 4(b) shows a 
characterization of the cancellation coefficient for the four different antenna configurations listed in Table I. 
Figure 4(b) shows that the channel can have large variations in magnitude in the practical case of antennas 
placed around a device, with a median of 9 dB p2p magnitude variations for A2 with device. 

Comparing Figure 4(b) with Figure 3(a), we observe the following. The larger the passive suppression, 
the larger are the variations of the self-interference channel as a function of frequency. Intuitively this 
makes sense since passive suppression of the self-interference corresponds to suppression of the strongest 
line-of-sight paths between self-interfering antennas. As the line-of-sight path is weakened, the self- 
interference channel becomes more dependent on weaker reflected multi-paths and this results in larger 
frequency selectivity of the self-interference channel. 

For scenarios where the channel is frequency-selective, the active analog cancellation must be able to 
adapt to the frequency variations of the channel per subcarrier, as is the case in our proposed implemen- 
tation of active analog cancellation. 

C. Performance of Analog Cancellation 

To better illustrate the importance of the per subcarrier adaptation of the analog canceler, we compare 
the performance of our per subcarrier analog cancellation with the performance of two analog cancellation 
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schemes that do not adapt the magnitude of the cancellation coefficient per subcarrier and use the same 
magnitude of the cancellation coefficient for all subcarriers (as is the case for the analog canceler schemes 
considered in [4, 5, 7]). 

Specifically, we consider the following two flat-frequency cancelers - (i) Flat-Frequency Canceller 1 
(FFC1): for this canceler the magnitude of the cancellation coefficient is the same for all subcarriers 
and is computed as the average from the required per subcarrier as (1/K) J2k=i \bi,m,n[k]\* an d (h) Flat- 
Frequency Canceller 2 (FFC2): for this canceler the magnitude of the cancellation coefficient is the same 
for all subcarriers and is computed as the value required by the middle subcarrier in the band hence it is 
equal to |6j imjn [-ft'/2]|. We highlight that the three analog cancelers, per- subcarrier, FFC1 and FFC2, are 
different only in the magnitude of the cancellation coefficient but have the same per subcarrier adaptation 
of the phase of the cancellation coefficient. The above simplification made our implementation easier for 
comparison while still allowing us to demonstrate the importance of per subcarrier adaptation. 

Figure 3(b) shows the amount of active analog cancellation that our proposed analog cancellation 
achieves for configurations Al without device and A2 with device and it also shows the performance of 
FFC1 and FFC2. We observe that per subcarrier adaptation of the magnitude of the cancellation coefficient 
achieves larger analog cancellation than FFC1 and FFC2. From the Figure 3(b) we approximate that per 
subcarrier adaptation of the magnitude of the cancellation coefficient achieves approximately 5 dB larger 
cancellation than FFC1 and FFC2. Hence, we obtain the following result. 

Result 3 (Gains from Per-subcarrier Cancellation): Per subcarrier analog cancellation improves the 
amount of analog cancellation, by approximately 5 dB, compared to cancelers which do not adjust the 
magnitude of the cancellation coefficient per subcarrier. 

From Figure 3(b) we observe that the analog cancellation was larger for the configuration without 
device compared to the configuration with device. Hence, the roles for best/worst cancellation are inverted 
with respect to what we had observed in Figure 3(a), where configurations with device showed better 
performance than configurations without device. The reason why the configuration with device achieves 
lower levels of analog cancellation is because analog cancellation is based on an estimate of the self- 
interference channel. The weaker the received self-interference (self-interference at the receiver antenna), 
the worse is the estimate of the self-interfering channel and the worse is the amount of analog cancellation 
achieved. Configurations with device have the weakest levels of received self-interference because they 
achieve the largest passive suppression. Hence, we have the following result. 
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Result 4 (Passive impacts Analog): As the amount of passive suppression increases, the amount of 
analog cancellation decreases. 

The reasoning for Result 4 was also noted in the simulation based analysis presented in [20] and the 
experiment based analysis of a narrowband canceller presented in [21]. This paper extends our prior 
narrowband single-antenna result to wideband multiple antenna systems. 

D. Performance of Digital Cancellation 

We are now interested in characterizing the performance of digital cancellation. For this purpose, we 
quantify the amount of digital cancellation achieved when placing a digital canceller after each of the three 
analog cancelers analyzed in Figure 3(b). These results for digital cancellation are shown in Figure 3(c). 
We observe that, when digital cancellation is placed after analog cancellation, the amount of digital 
cancellation achieved after our proposed per subcarrier analog canceller is less than the amount of digital 
cancellation achieved after FFC1 and FFC2 cancelers. This behavior is due to the following result. 

Result 5 (Analog impacts Digital): As the amount of analog cancellation increases, amount of digital 
cancellation decreases. 

The reason for Result 5 is that as the amount of analog cancellation increases, the residual self- 
interference decreases, hence there is more noise in the estimation of the residual self-interference after 
analog cancellation and this results in less digital cancellation. In the limit, if analog cancellation can 
achieve infinite dB of cancellation (perfect cancellation), then digital cancellation becomes unnecessary 
and applying digital cancellation in this limit case will only lead to an increase in the noise. The reasoning 
for Result 5 was also noted in the simulation based analysis presented in [20] and the experiment based 
analysis of a narrowband single-antenna canceler presented in [21]. More details on wideband multiple 
antenna experiment results that demonstrate Result 5 can be found in [19]. 

E. Total Cancellation of Physical Layer Design 

We now compare the performance of our physical layer design, which uses per subcarrier analog 
cancellation, with the performance of cancellation designs that do not use per subcarrier analog cancella- 
tion. Specifically, we compare the results for total cancellation for systems which have the same passive 
suppression and digital cancellation mechanisms but use different analog cancelers. The different analog 
cancelers being the ones analyzed in Figure 3(b) (per subcarrier, FFC1, and FFC2). The results for total 
cancellation are show in Figure 3(d). We observe that using per subcarrier analog cancellation achieves 
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the largest total cancellation and the improvement is approximately 3 dB. We note that the advantage of 
our per subcarrier analog cancellation is, not only that it improves the total cancellation by 3 dB, but also 
that it achieves larger pre- ADC cancellation compared to the FFC1 and FFC2 systems. Predictably, we 
will show in Section V-B, larger per subcarrier analog cancellation results in larger rates than using FFC1 
or FFC2 cancelers. 

Next, we analyze the total cancellation of our design for the four different antenna configurations showed 
in Table I. Figure 3(e) shows a characterization of the total cancellation achieved when combining passive 
suppression with active per subcarrier analog and digital cancellation. We observe that A2 with device 
achieves the largest total cancellation. The cancellation values for A2 with device are between 70 dB and 
100 dB with a median of 85 dB. In general, we observe that for the same implementation of active analog 
and digital cancellation, the largest cancellation will be obtained with the configuration that achieves the 
largest passive suppression. This leads to an important direction that antenna design and placement are 
crucial for achieving practical full-duplex, and the design has to be cognizant of the device dimensions 
and placement. Finally, we observe that the performance of the cancellation scheme was very similar 
between the FD2xl and FDlxl systems. 

To the best of our knowledge, the levels of cancellation achieved by our A2 with device implementation 
are the best reported for a wideband 20 MHz multiple subcarrier and multiple antenna full-duplex system. 
The results provided in [5,6, 10, 11,21] correspond to narrowband systems. The results in [7] are for a 
multiple subcarrier system with 10 MHz bandwidth and correspond to a single interference antenna. The 
work in [7, 10, 11] does not report a measured value of total cancellation for a combination of passive, 
active analog, and active digital cancellation and focuses only on characterizing a subset of these types of 
cancellations. Finally, none of the previous works [5-7, 10, 11,21] report cancellations larger than 73 dB. 
Hence, we have the following result. 

Result 6: Our proposed self-interference canceller design, for 20MHz FDlxl and FD2xl systems, 
achieves total self-interference cancellation values similar or larger than what prior work has reported. 

Finally, we characterize the amount of residual self-interference to noise ratio. The residual self- 
interference is the amount of self-interference left after all the cancellation stages (passive, active analog, 
and active digital) have been applied. Figure 5 shows the residual self-interference to noise ratio (INR). As 
expected, configuration A2 with device results in the lowest levels of residual INR since this configuration 
is the one that achieves the largest cancellation. Although our self-interference canceller design can achieve 
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larger cancellation than what related work has reported, we observe from Figure 5 that these cancellation 
values are not enough to guarantee that the self-interference is reduced to the noise floor (INR=0 is not 
guaranteed). However, as we will show in Section V-B, there are conditions under which full-duplex can 
achieve higher rates than half-duplex even if the self-interference is not reduced to the noise floor. 

V. PHY Evaluation: Rate Performance 

A. Metric for PHY Rate Analysis: Empirical Ergodic Rates 

The ergodic rate is the fundamental measure of PHY layer capacity in fading channels [22] and is an 
upper bound on the throughput that would be achieved by any MAC protocol. The ergodic rates become 
the starting point for a system designer to choose actual constellation sizes and code rates. The ergodic 
rate (ER) for transmission to Node % is given by E (Ri) = E [log(l + SINRjfp])] where the expected value 
is computed as the average over all the packets p transmitted to Node % and SINRj [p] is the post processing 
Signal-to-self-Interference-plus-Noise-Ratio for packet p received at Node i. 

The empirical ergodic rate in experiments is computed based on an estimate of SINRj[p]. We estimate 
SINRj[p] from transmitted and received constellation symbols as follows. The constellation symbol Sj is 
sent to Node % via the wireless channel. Node % processes the received signal and computes Sj which is 
the estimate of s,. The average energy of the error or noise is given by E[|sj — Sj| 2 ]. Post processing 
SINR for packet p received at node i, SINRj[p], is computed as SINRjp] = E ^ L ._'^| 2 ] where the expected 
value is computed as the average over all the symbols transmitted to Node % during packet p. 

Since the two-way communication in half-duplex is achieved by time sharing the link with a fraction 
of time f3 dedicated for transmission from Node 1 and a fraction of time 1 — /3 dedicated for transmission 
from Node 2, the ergodic rate for each node in a half-duplex two-node communication system has to be 
scaled by their time of transmission, leading to E (i?f D ) = (3E (i?i) and E (R% D ) = (1 - (3)E (R 2 ). We 
performed only symmetric experiments where (3 = 0.5. For full-duplex transmissions, since both nodes 
transmit at the same time, the ergodic rate for each node in a full-duplex communication system is given 
by E (R[ D ) =E{R 1 ) and E (R( D ) = E (R 2 ). 

B. Comparison of Full-duplex and Half-duplex Ergodic Rates 

Previous work on full-duplex implementation [5,7, 11] had not considered the case of placing antennas 
around the full-duplex device. In Section IV, we demonstrated that placing the interfering antennas around 
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the full-duplex device can improve the total cancellation by 10 dB with respect to the case where antennas 
are not placed around a device. In this section, we show that the increase in total cancellation results in 
full-duplex rate gains at WiFi ranges. 

Experiment results in Figure 6 show the ergodic rate for transmission to a node in a two-way link as a 
function of the average received signal to noise ratio at a node (SNR). Experiment results in Figure 6(a) 
correspond to the case of antennas placed around the device (configurations Al and A2 with device in 
Table I). Experiment results in Figure 6(b) correspond to the case of antennas placed without the device 
in the middle (configurations Al and A2 without device in Table I). For each FD/HD system and antenna 
configuration, we ran two experiments for Links 1, 6, 7, 8, 9, and 10, and we ran one experiment for Links 
2, 3, 4, 5. Hence, for each FD/HD system and antenna configuration we ran 16 experiments (an experiment 
consisted in transmitting 90 packets from each of the two nodes in the link). Results in Figure 6 are per 
node per link per experiment per system. Hence, Figure 6(a) shows 64 markers for each system, which 
corresponds to 2 antenna configurations (Al without device and A2 without device) x 16 experiments x 
2 nodes. Similarly, Figure 6(b) shows 64 markers for each system. 

For each system depicted in Figure 6, we present experiment results with markers and also show a line 
which corresponds to a linear fit of the experiment results. The linear fit for the FD2xl, FDlxl, and 
HD2x2 systems was computed based on the data points that lied between 5 dB and 30 dB SNR. We only 
used this interval for the fit because most of the data points lie inside this interval and because this interval 
contains SNR values typical of WiFi operation (SNR < 30 dB). For the HD2xl and HD3xl systems 
the linear fit was computed based on the data points that lied between 5 dB and 23 dB. The reason we 
used this interval for the HD2x 1 and HD3x 1 linear fits is because we observe that for SNR values above 
23 dB the rate of HD2xl and HD3xl systems does not increase as the SNR increases. This leads us to 
conclude that our HD2xl and HD3xl implementations reach a performance ceiling at SNR ps 23 dB. 
We have observed that the performance ceiling of our implementation is a function of the bandwidth. 
When we use only one subcarrier (625 kHz bandwidth system) we do not observe a performance ceiling. 
We have not observed a performance ceiling for our FD or HD2x2 implementations. 

For the experiments results in Figure 6, we computed the SNR as the ratio of the average signal of 
interest RSSI at the node to average noise power. The average RSSI at a node was computed by averaging 
the measurements of received signal of interest power per packet over the 90 packets received during an 
experiment run. The average noise power was estimated to be —90 dBm for all experiments. The above 
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estimate of noise power was based on the radio data sheet and on node calibration performed before the 
experiments were started. The ergodic rate was computed using per packet measurements as was explained 
in Section V-A. 

We now analyze the rate performance of full-duplex and half-duplex systems that do not use antenna 
placement around the device. The rate results for these systems are shown in Figure 6(b). From Figure 6(b) 
we observe that for SNR < 30 dB, the experiment data points for the full-duplex systems are mostly 
below the experiment data points for half-duplex systems. Consequently, since the linear fits only consider 
experiment data points for which the SNR was lower than 30 dB, the linear fit for the FD2x 1 and FD1 x 1 
systems lies below the linear fit for the half-duplex systems. We observe that the experiment data points 
in Figure 6(b) show that FD2xl and FDlxl systems can have similar or large rates than half-duplex 
systems at SNR values larger than 30 dB. However, SNR values larger than 30 dB are not typical in WiFi 
systems. Consequently, from results in Figure 6(b) we conclude that full-duplex gains at WiFi ranges 
cannot be achieved with self-interference cancellation schemes that have a median cancellation of 78 dB 
or less, as is the case for the full-duplex systems without device considered in our experiments. 

We next analyze the performance of full-duplex and half-duplex systems that use antenna placement 
around the device. By comparing the experiment data and linear fits in Figure 6(a) with the experiment 
data and linear fits in Figure 6(b) we observe the following. Placing antennas around the device (device 
in the middle) improves the performance of full-duplex and half-duplex systems with respect to the case 
where antennas are not placed around the device. In other words, for the same SNR, full-duplex and 
half-duplex rates tend to be higher when the antennas are placed around the device. For the half-duplex 
systems this is explained by the fact that placing antennas around the device reduces the transmitter and 
receiver correlation and as these correlations decrease, the half-duplex rates increase [12,22]. For the 
full-duplex systems this is explained by the fact that placing the antennas around the device achieves 
larger cancellation than the configuration without a device in the middle of the antennas. 

The larger self-interference cancellation achieved by placing the antennas around the device yields the 
following result. 

Result 7: FDlxl and FD2xl systems with antenna placement around the device can consistently 
achieve larger rates than HD2x2, HD3xl, and HD2xl systems for SNR range of 20-30dB, thereby 
covering nearly half the range of a typical WiFi system. 

Result 7 can be verified from Figure 6(a). The linear fit for FD2xl and the linear fit for HD2x2 in 
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Figure 6(a) shows that FD2x 1 can achieve rates larger than HD2x2 for SNRs approximately higher than 
20 dB. Similarly, the linear fit for FD1 x 1 and the linear fit for HD3x 1 in Figure 6(a) show that FD1 x 1 
can achieve larger rates than HD3x 1 for SNRs approximately higher than 20 dB. 

From Figure 6(a) we note that, for the case where antennas are placed around a device, the full-duplex 
systems have a larger multiplexing gain than all half-duplex systems, where multiplexing gains are the 
slopes 2 of the rate-SNR curves in Figure 6. For the case of antennas without a device, from Figure 6(b) 
we observe that most of the full-duplex rates lie below 1.5 bps/Hz which explains the lower slopes of 
FD systems compared to HD systems when the antennas are placed without a device in the middle and 
when we consider SNRs lower than 30 dB for the linear fit. Thus, we obtain the following result. 

Result 8: FD1 x 1 and FD2 x 1 systems were measured to have a larger multiplexing gain per-node than 
HD2x2, HD3xl, and HD2xl systems for the case where the antennas are placed around the device 
(device in the middle). 

In Figure 6(a) the slope of the fit for the FDlxl system is approximately 1.3 times larger than the 
slope of the fit for the HD2x2 system and approximately 3.7 times larger than the slope of the fit for 
the HD3xl and HD2xl systems. The slope of the fit for the FD2xl system is approximately 1.5 times 
larger than the slopes of the fit for the HD2x2 system and approximately 4.2 times larger than the slopes 
of the HD3xl and HD2xl systems. 

FDlxl and FD2xl systems have a larger slope than HD3xl and HD2xl systems because of the 
following reason. At high SNR, the HD3xl and HD2xl systems can have a maximum multiplexing 
gain per-node of 0.5, since each node only transmits half the time. In contrast, at high SNR the full- 
duplex systems can have a maximum multiplexing gain per-node of 1, which is not scaled by 0.5, since 
full-duplex systems can transmit during the entire time slot. Thus, we conclude that FDlx 1 and FD2x 1 
systems are expected to have steeper slopes for rate increase vs. SNR compared to HD2x 1 and HD3x 1 
systems. We think that a reason why the fit for the FD systems achieves more than twice the slope of 
HD3xl and HD2xl systems is due to the performance ceiling of our implementation of HD3xl and 
HD2 x 1 which was explained earlier. 

As evident from the results in Figure6(a), FD1 x 1 and FD2x 1 rates can also have a larger slope than a 
HD2x2 system, which is a little surprising but we conjecture the following reason. As discussed above, 
the full-duplex systems can have a maximum multiplexing gain of 1. At high SNR, a 2x2 MIMO system 

2 If a system has multiplexing gain of r then at high SNR the rate in bps/Hz can be approximated as r log 2 SNR and the slope of increase, 
in a plot where the y axis is in bps/Hz and x axis is in dB, is equal to (rlog 2 SNR)/(10 * log 10 SNR) = 0.332r 
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can have a maximum system multiplexing gain of 2, which implies that HD2x2 can have a maximum 
multiplexing gain per-node of 1 (since each node transmits only half the time). As shown in [12], for 
a 2x2 system to achieve maximum multiplexing gain, the mean channel condition number should be 
approximately 3.5 dB or less (see Figure 6 in [12], where values of pr = Pr = lead to a multiplexing 
gain of 2 at high SNR and from equations (10) and (14) we compute that p T = p R = correspond to 
a condition number of 3.5 dB, computed as 101og 10 a/(4 — 0.65)/0.65). From both our results and the 
results in [12], it appears that indoor channels typically do not have condition numbers which meet the 
above criterion, therefore reducing achieved multiplexing gains of MIMO systems compared to theoretical 
maximum. Figure 7 shows the CDF of the condition number measured per packed from our experiment 
measurements. From Figure 7 we observe that the probability of having a condition number less than or 
equal to 3.5 dB is small (less than 10%) 

Next, we comment on the importance of per-subcarrier cancellation. The lower active analog cancellation 
achieved by FFC1 and FFC2 in a frequency selective environment (see Figure 3(b)) results in a degradation 
of the achievable rate performance as shown in Figure 8. More specifically, we observe from results in 
Figure 8 that at an SNR of 15 dB, schemes FFC1 and FFC2 result in approximately 13% performance 
loss compared to our proposed scheme. 

Finally we note that, although through-device cancellation is a simple and effective means of cancel- 
lation, antenna placement techniques in [5, 10, 11] may not be able to implement and take advantage of 
through-device cancellation. The reason is that antenna placement techniques in [5, 10, 11] are designed 
under the assumption that the self-interference channel does not have multi-path components. As we 
showed in Section IV-B, placing the antennas around the device makes the multi-path effect more severe. 
Consequently, the assumptions that should hold for antenna cancellations in [5, 10, 11] to be effective 
may not hold (or hold for fewer realistic conditions) when the antennas are placed around a device. 

VI. MAC Design 

In this section, we describe the design of the proposed MAC. The driving goal for full-duplex MAC 
design is minimal changes to the current WiFi standard, to support both half- and full-duplex nodes, and 
in the process, accelerate its adoption. We limited our attention to support full-duplex communication only 
between two nodes, i.e, when a mobile node and the AP have a packet for each other. With that in mind, 
we modified the standard half-duplex 802.11 Distributed Coordination Function (DCF) WiFi MAC [23] 
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with RTS/CTS to add a full-duplex mode. In the rest of the section, we describe the modifications to legacy 
WiFi MAC to support the full-duplex mode, while the complete details are provided in Appendix A. 

We divide our discussion on the changes to legacy WiFi MAC in three parts: (a) discovery and 
transmission of full-duplex packets, (b) management of ACKs and (c) behavior of overhearing nodes. 
Discovery and transmission of full-duplex packets: The first challenge is to get an opportunistic full- 
duplex data transmission between two nodes. This is achieved with the following changes from the existing 
standard 802.11 modules. We make use of the standard Request to Send (RTS) and Clear to Send (CTS) 
packets. The sender (also known as the primary node) signals intent to send a data packet using a 802.11 
RTS packet. The node receiving the RTS (secondary node) then discovers the transmit node id, which is 
needed to start the full-duplex transfer to the sender. Since the secondary node knows that the primary 
sender will be transmitting, it finds opportunity to send a full-duplex data as follows. 

In the standard 802.11 protocol, the RTS receiver sends a CTS frame and listens to the incoming 
data. However, in full-duplex MAC, we would also like the secondary node to transmit to the primary 
immediately after sending the CTS frame (after the standard SIFS time) whenever data is available. Note 
that at the head of the queue in the secondary node transmit buffer, there may not be a packet intended 
for the primary sender. We address this by making the secondary node inspect its queue and select the 
very first packet intended for the primary sender. If necessary, the secondary node further updates the 
Network Allocation Vector (NAV) based on the original NAV it received during the RTS frame and the 
length of secondary packet. 

In the standard 802.1 1 protocol, if a node receives a packet while transmitting, the data being transmitted 
is considered to have collided and it tries to retransmit the data. However, in our design both transmission 
and reception can occur at the same time, and hence a full-duplex exchange should not to be interpreted 
as a collision. Full-duplex MAC therefore checks if the received packet is from the other node involved 
in RTS-CTS exchange and accepts the packet if the NAV duration has not expired. 

Management of ACKs: The second challenge for the two nodes involved in full-duplex data transmis- 
sion is to send and receive acknowledgements for the successful transmissions. In legacy 802.11, after 
sending the data, a node expects an acknowledgement (ACK) frame. However in full-duplex, since data 
is sent from both nodes simultaneously, each node gets data before it gets an ACK. This is fixed by 
accepting one data packet in the NAV duration, and returning to the prior state where it is still waiting 
for ACK. However, on reception of a second data transmission before reception of ACK would result in 
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the node concluding that the data has collided. 

After the full-duplex transmission, both the nodes involved in transmission are waiting for an ACK. In 
legacy 802.11, nodes cannot transmit while waiting for ACK until the ACK timeout duration has passed. 
This would lead to neither of the two nodes receiving ACK (since the other node will not transmit) thus 
leading to the ACK getting timed out at both the nodes. In order to avoid this ACK timeout, full-duplex 
MAC allows the nodes to send an ACK even while waiting for ACK from the other end. Further, after 
sending the ACK, the node returns to the state where it is waiting for the ACK packet. 

For asymmetric data packet lengths, the node sending a smaller length packet has to wait a long time 
to receive the ACK because the other node with larger packet length has not finished sending data. Thus 
in full-duplex MAC design, the wait time for the reception of the ACK packet for both nodes involved 
in the transmission is re-adjusted to the end of the NAV duration if necessary. 

Behavior of overhearing nodes: The third challenge is to ensure that the node pair involved in full- 
duplex communication does not get an unfair advantage of capturing the channel repeatedly. In the standard 
802. 1 1 model, all nodes use a longer wait time (Extended Interframe space, EIFS) rather than the standard 
wait time (DCF Interframe Space, DIFS) if they receive an erroneous packet. The extended wait time is 
used to allow some other recipient for the packet who received the data correctly to be able to send an 
ACK frame in time. In full-duplex systems, all nodes that are not involved in full-duplex transmission will 
detect an erroneous packet and thus wait for EIFS. However, the nodes involved in full-duplex transmission 
do not find the reception erroneous and thus start their back-off timer after waiting for DIFS. Since the 
DIFS wait time is smaller than EIFS wait time, there is higher likelihood for one of the full-duplex nodes 
to grab the channel again causing unfairness. 

We propose two different strategies to solve this problem. The first is optimized for a system where all 
nodes are full-duplex. In full-duplex MAC design, full-duplex nodes that receive a CTS packet successfully 
that is not intended for them, ignore any erroneous packets during the NAV duration specified in CTS. 
Thus the nodes will not subsequently detect an erroneous packet caused by collision of two data streams 
and therefore not switch to using the EIFS mode. Note that the above change does not defeat the purpose 
of EIFS since the NAV time has already been understood by the node which includes the time taken for 
ACK transmission. 

However, note that in a system with both full-duplex and legacy half-duplex stations, as shown in 
the timing diagram in Figure 9, the half-duplex nodes wait for EIFS rather than DIFS after a full- 
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duplex transmission between AP and a full-duplex station in the system, leading to unfairness in the 
uplink throughput. To avoid unfairness, we propose a second strategy where full-duplex nodes do not 
ignore erroneous receptions during NAV except for the two nodes involved in full-duplex transmission to 
wait for EIFS after every full-duplex message exchange. Further, the two nodes involved in full-duplex 
transmission use a waiting time of EIFS rather than DIFS after receiving ACK. The change in waiting time 
will lead to increased wait times when there are a lot of full-duplex nodes while not compromising the 
uplink throughput of half-duplex stations significantly. We will analyze both strategies in Section VII-D. 

VII. MAC Evaluation 

In this section, we evaluate the full-duplex MAC protocol discussed in Section VI using a commercial 
software package, OPNET Modeler-Wireless. We started with the standard 802.11 codebase available 
in OPNET and made the necessary modifications to implement full-duplex MAC. Figure 9 shows a 
typical full-duplex framing structure where the RTS frame is followed by a SIFS time and a CTS frame. 
After another SIFS time, data transmission occurs simultaneously, and is then followed by a SIFS and 
an ACK. The following timings in /isec are shown from our evaluated 802.1 la-based full-duplex MAC 
implementation. For 18Mbps, DIFS=34/xs, RTS=36/zs, SIFS=16/is, CTS=32 / us, Data=704 / us, ACK=32 / us. 
Note that after a full-duplex data transmission, the ACKs from both the nodes involved in the full-duplex 
exchange are transmitted simultaneously. 

First, we study how goodput varies with different modulation formats and varying packet sizes for 
a system with one AP and one full-duplex node (STA). Next, we extend our understanding regarding 
goodput performance for a system with multiple full-duplex nodes. Further, we study a system with 
a mix of full-duplex and half-duplex nodes, where the half-duplex nodes ignore collisions during the 
NAV duration set by a RTS/CTS; this is non-standard behavior for legacy half-duplex nodes but serves 
to highlight the coexistence dynamics. Finally, we study the system with a mix of full and half-duplex 
nodes, where the half-duplex nodes are legacy nodes which do not ignore collisions in NAV duration. 

Table IV provides the simulation parameters used. In all cases, we used AP to STA (node) distance 
separation of 14 m. The free space path loss formula [24] was used to compute the amount of path loss. 
To model bit error rate, we assume that the self-interference and thermal noise are Gaussian, and we use 
the standard Q-functions for uncoded modulation [24]. The maximum packet size is 1500 bytes; there is 
no segmentation and each packet is sent as the payload of one full-duplex MAC frame. 
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A. Goodput characterization for different packet sizes 

We evaluate full-duplex MAC design for QPSK (18 Mbps) [25]. The rate value, 18 Mbps for QPSK, 
refers to the radio transmission rate. For a two-node system with one AP and one node STA, we compare 
a full-duplex system against a legacy half-duplex system with or without RTS/CTS. Total goodput is 
defined as the sum of the goodput from the AP to the STA and from the STA to the AP. For half-duplex 
QPSK-18 Mbps, the total MAC goodput for half-duplex system without RTS/CTS is 13.69 Mbps, total 
goodput for half-duplex system with RTS/CTS is 12.8 Mbps while for full-duplex the MAC goodput is 
25.62 Mbps for packet size of 1500 bytes. Thus, there is a 87% performance gain for full-duplex over 
half-duplex system without RTS/CTS, including all MAC overheads and 100% gain for full-duplex over 
half-duplex system with RTS/CTS. 

Result 9 (Gains for symmetric and asymmetric traffic): (a) Full-duplex MAC design doubles the through- 
put as compared to a legacy WiFi half-duplex with RTS/CTS, and increases by 87% as compared to legacy 
WiFi half-duplex system without RTS/CTS for symmetric traffic. 

(b) Full-duplex goodput is always higher than half-duplex goodput, where the half-duplex system may 
or may not use RTS/CTS. 

We now study the goodput performance when the traffic flow is asymmetric. Given that data traffic is 
predominantly based on TCP, an asymmetric TCP download-only traffic will have 1500 bytes from the 
AP to the STA (downlink) and 40 bytes of TCP ACK packets going from the STA to the AP (uplink). 
We model asymmetry by varying the size of the packets from the STA to the AP, while keeping the 
cancellation fixed at 85 dB. We note that it is well recognized that the performance gain of full-duplex 
is influenced by traffic asymmetry [26]. 

Table V shows that as we increase the uplink packet size from 40 bytes to 1500 bytes, full-duplex 
provides a goodput gain ranging from 1.3x (13.14 vs 10.07) to 2x (25.62 vs 12.8) as compared to half- 
duplex system using RTS/CTS. The full-duplex goodput is always higher than half-duplex goodput, where 
the half-duplex system may or may not use RTS/CTS. As the length of uplink data packet decreases, the 
uplink throughput decreases for a full-duplex system while the downlink throughput remains the same. 
The uplink throughput is a factor of the size of data from node to AP divided by size of data from 
AP to node less than downlink throughput. In other words, the throughput divided by size of data in 
each direction is constant. In half-duplex system, the time-resource is shared by the two nodes while in 
full-duplex system, the downlink transmission always happens and the uplink transmission can be viewed 
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as a bonus. 

We now compare the effect of varying self-interference cancellation in Figure 10 assuming symmetric 
1500 byte data in both directions. We note that as the self-interference cancellation improves, higher 
constellations can be supported for full-duplex system thus improving the rate. The rate for half-duplex 
system does not depend on the self-interference cancellation. In the rest of this paper, we will mainly 
illustrate the results on 18 Mbps QPSK constellation (1500 bytes packet size) noting that the improvement 
factors will be similar at a fixed self-interference cancellation value of say 82-83 dB. 

B. Goodput characterization for multiple full-duplex nodes 

In this section, we consider the effect of scaling the network size to indicate multiple nodes with the 
packet sizes for all nodes being 1500 bytes. 

Result 10 (Gains for full-duplex system with multiple nodes): The sum throughput for a full-duplex sys- 
tem with multiple nodes increases by a factor of approximately two when compared to a half-duplex system 
with RTS/CTS handshake. 

Consider the case when there are n full-duplex nodes and one AP. In a 802.11 system, each node has 
equal chance of winning the channel contention due to random backoff and thus each of the n + 1 node 
accesses the channel with equal probability. 

For comparison of full- and half-duplex throughput, we first ignore all timing and collision overheads 
and assume that all nodes have infinite queues that are always saturated and have data to be transmitted. 
Further, the destination of each new arriving packet at the AP is uniformly distributed across all nodes. In 
a half-duplex system, each node grabs the channel for a fraction l/(n + 1) time. When a node grabs the 
channel, it sends data to AP and thus, the fractional time-slots used for data transfer from node to AP is 
l/(n + 1). When AP grabs the channel, it sends the data at the head of the queue to the appropriate node. 
Since the head of the queue can be addressed to any node with equally probability, AP sends packet to a 
node for l/(n(n + 1)) of the time. The total throughput is normalized such that the sum throughput for 
the legacy half-duplex system is 1. 

For a full-duplex system, each node grabs the channel for a fraction l/(n+ 1) time. At each time, there 
are two concurrent communication paths ongoing, and thus the normalized throughput is 2. The uplink 
throughput from any node and downlink throughput to any node is 1/n which means that each node 
transmits to AP for 1/n fraction of the time-slots and vice versa. Thus, we see that full-duplex improves 
the uplink throughput from a node from l/(n+ 1) to 1/n which translates to an improvement factor of 
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1 + 1/n. The downlink throughput to node on the other hand improves from l/(n(n + 1)) to 1/n, an 
improvement by a factor of n + 1. Even though the total throughput improves by a factor of 2, for n > 1, 
the downlink throughput to a node improves by a larger multiple as compared to the uplink throughput 
from a node. The analysis here is summarized in the rows corresponding to full- and half-duplex systems 
in Table VI. 

The above analysis ignored several aspects, e.g. collisions and different packet sizes. The half-duplex 
system without RTS/CTS has collisions of data frames while full-duplex system has collisions of RTS 
frames. The results from OPNET simulations provide realistic throughputs incorporating all overheads as 
shown in Table VII. We report the sum goodput for the system together with average uplink/downlink 
goodputs. We also report collision statistics, for a system with RTS/CTS, (e.g. percentage of RTS requests 
that collided as a percentage of total RTS transmissions). Percentage RTS collisions is specifically the ratio 
of RTS that were not successfully received divided by total RTS packets sent including retransmissions 
multiplied by 100. Similarly, for half-duplex system without RTS/CTS, the data suffers collision and thus 
needs to be retransmitted. We provide the percentage of collided data frames which is the percentage of 
data packets not successfully received. 

The ratio of sum goodput for full-duplex to a half-duplex system should theoretically be a factor of 2. 
We enumerate the ratio of sum goodput for the following cases: 2 for n = 1, 2.03 for n = 2, 2.03 for 
n = 4, 2.02 for n = 8, when compared to half-duplex system with RTS/CTS. We note that the number of 
retransmissions for a full-duplex system are much lower compared to a half-duplex system with RTS/CTS, 
e.g. the number of RTS retransmissions reduce from 17.8% in the half-duplex case for n = 2 to 8.1% for 
the full-duplex case. 

We further observe that half-duplex system with RTS/CTS achieves better throughput than without 
RTS/CTS for n > 4. RTS/CTS incurs timing overheads that reduce the system throughput. However 
without RTS/CTS, a significant fraction of the time is wasted for recovering from the collisions of 
data frames which degrade throughput more than the collisions of RTS frames. Thus when n > 4, 
the collisions are high, and thus the time saved by having RTS frame collisions compared to having data 
frame collisions far outweigh the overhead of RTS/CTS signaling. Consequently, we use a half-duplex 
system with RTS/CTS as the baseline reference case for the rest of this paper. 

From the above discussion, the uplink throughput from a node is a factor n higher than the downlink 
throughput to a node for a half-duplex system. We can be observe in Table VII that factor n difference 
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holds approximately for the half-duplex system. We now compare the downlink throughput in full-duplex 
and half-duplex systems and note that theoretically, the full-duplex throughput should increase by a factor 
of Based on the results, the gain of downlink throughput from in a full-duplex system compared to a 
half-duplex system with RTS/CTS is 2, 1.52, 1.28 and 1.15 for n — 1, 2, 4 and 8 respectively as compared 
to the theoretical 2, 1.5, 1.25 and 1.13 respectively. For the uplink case, the throughputs of full-duplex 
should increase by a factor of (n + l)/n as compared to a half-duplex system. Our experimental results 
show improvement factor of 2, 3.08, 4.85 and 8 for n = 1, 2, 4 and 8 nodes respectively compared to the 
theoretical values of 2, 3, 5 and 9 respectively. Thus, we find that the simulation results match well with 
theory. 

C. Coexistence for full-duplex and modified half-duplex nodes that ignore collision during NAV 

We examine the performance when legacy half-duplex and full-duplex nodes coexist within the same 
wireless LAN. Suppose that there are n = 2m nodes comprising of m full-duplex and m half-duplex 
nodes. We first assume the case where the half-duplex nodes use a modified MAC design. The modified 
version provides an insight into the dynamics of co-existence, and also touches on an unfairness issue 
that we will highlight in the next subsection. Note that the modification may or may not be pragmatic in 
real scenarios. 

We assume that the modified half-duplex nodes ignore collisions during the NAV duration. Each node 
grabs the channel for a fraction l/(n + 1) time. When a half-duplex node grabs the channel, it sends 
data to AP. When a full-duplex node grabs the channel, AP also sends packets to full-duplex node. As 
a result, packets in the queue at AP for full-duplex nodes get depleted faster than the half-duplex nodes 
and does not have packets for full-duplex node with probability 1 resulting in the AP always transmitting 
to half-duplex nodes. Thus, the AP sends data to half-duplex nodes for fraction l/(m(n + 1)) time-slots. 
The half-duplex node sends data to AP for a fraction l/(n+ 1) time-slots. Further, full-duplex node sends 
data to AP and AP sends data to full-duplex node for a fraction l/(n + 1) time- slots. 

Thus, the downlink throughput to half-duplex node increases from 1/ (71(71+ 1)) in a legacy half-duplex 
system to l/(m(ra+l)) which is a factor of two improvement. The downlink throughput to full-duplex node 
increases from 1/(71(71 + 1)) in a legacy half-duplex system to l/(n + l) which is a factor n improvement. 
Further, the uplink throughput from half-duplex/full-duplex node remains the same as a legacy half-duplex 
system. The overall sum throughput is thus 1 +771/(71 + 1) factor more than the throughput of a half-duplex 
system. These results are summarized as Case 1 in the coexistence case in Table VI. 
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Result 11 (Coexistence with modified half -duplex nodes): In a mix of m full- and m half -duplex nodes, 
the total throughput increases compared to a half-duplex-only system by a factor of 1 + m/{2m + 1) 

In Table VIII, we see the goodputs for the case when there are m full-duplex and m half-duplex nodes. 
Case 1 represents the case where the modified half-duplex nodes avoid collisions during the NAV window. 
Note that the total throughput increases by a factor of 1.39, 1.42, and 1.45 for m — 1,2,3 respectively 
from the complete half-duplex system, as compared to 1.33, 1.4 and 1.43 as predicted from the theory 
above. 

Further, we note that the uplink throughput from any node to AP remains almost the same as the 
throughput from node to AP in a half-duplex node, which matches the above theoretical results. The 
downlink goodput to full-duplex node is the same as the uplink goodput from full-duplex node, which is 
also the same as the uplink goodput from a node in a legacy half-duplex system. Finally, the downlink 
throughput to half-duplex node should increase by a factor of 2 from the complete half-duplex system. 
We note that throughput improvement factor from the simulations is 1.8 for m — 1, 2.05 for m — 2, and 
2.5 for m = 4, comparable to the theoretical results. 

D. Coexistence for full-duplex and legacy half-duplex nodes 

In this subsection, consider legacy half-duplex nodes which will consider the full-duplex exchange as 
a collision and therefore would wait for EIFS rather than DIFS after every full-duplex transmission. The 
results for this scenario can be seen in Table VIII, Case 2. When the half-duplex nodes are legacy, we note 
that the full-duplex nodes wait for DIFS while the half-duplex nodes wait for EIFS after the end of every 
full-duplex transmission. The full-duplex nodes thus obtain an unfair advantage in accessing the channel. 
Thus, the throughput from the half-duplex node to the AP is reduced. Further, packets for full-duplex 
node are removed from the queue at the AP faster, and when AP successfully gets the channel, it has 
packets remaining only for half-duplex node. Therefore, the downlink to half-duplex and the goodput from 
AP to full-duplex node increase when the half-duplex nodes are legacy. Since full-duplex transmissions 
happen for a larger time-fraction, the overall sum goodput of the system increases. The above results 
were generated when the buffer at the AP is large. When the buffer at AP is limited, AP may not have 
packets for full-duplex node further limiting the throughput from AP to full-duplex node and thus the 
overall throughput reduces matching the performance closer to a half-duplex system. 

For the scenario with legacy half-duplex nodes, we propose a coexistence based MAC change for full- 
duplex nodes such that the uplink throughput of half-duplex node is not greatly impacted. The change for 
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the full-duplex nodes not involved in the full-duplex exchange is to wait for EIFS for full-duplex exchange 
like the legacy half-duplex nodes would be doing. The change for the full-duplex nodes involved in the 
full-duplex exchange is to wait for EIFS after every full-duplex data exchange like the legacy half-duplex 
nodes would be doing. The above change ensures that all nodes wait for DIFS before backoff after a 
half-duplex packet exchange, but wait for EIFS before backoff after a full-duplex packet exchange. As 
a result, the full-duplex nodes are more polite and do not get unfair advantage of repeatedly grabbing 
the channel. However, increased wait times after each full-duplex transmission is slightly disadvantageous 
when there are a lot of full-duplex nodes compared to legacy half-duplex nodes since the full-duplex 
nodes will wait longer (EIFS instead of DIFS). We note that the decrease in throughput due to extra wait 
time is not significant. The overall throughput for all full-duplex node decreases from 25.6 Mbps to 25.05 
Mbps for n = 8 and the throughput for 4 full-duplex and 4 half-duplex system reduce from the originally 
defined full-duplex MAC and slightly modified half-duplex MAC above from 18.37 Mbps to 17.12 Mbps 
for the "graceful" full-duplex system with legacy half-duplex system. Even though the overall throughput 
decreases, the extra wait time based system does not compromise the legacy half-duplex nodes' uplink 
throughput significantly. Thus, we can summarize our main result of this section as follows. 

Result 12 (Coexistence of Modified full-duplex and legacy half-duplex nodes): Overall system through- 
put decreases with extra wait time for full-duplex nodes, as compared to the coexistence between full-duplex 
and modified half-duplex nodes, but ensures minimal loss to legacy nodes' throughput. 

E. Extensions 

The full-duplex MAC can be extended to incorporate the following features. 

Beyond DCF: 802.11 modes, such as HCF and frame aggregation, may be useful to aggregate smaller 
packets to achieve increased symmetry between the two directions of full-duplex. 

Full-duplex Data Available: A CTS sender can indicate that it intends to send full-duplex data by 
re-purposing an unused bit (e.g. the More Data bit in the CTS Frame Control field [25] section 7.2.1). 
Another option is for a full-duplex node to use two MAC addresses, one for half-duplex use, and the 
other for full-duplex use. This allows the RTS sender to adapt its rate accordingly. The new duration can 
be set as a NAV duration by the CTS sender. 

Three-node full-duplex: The AP can receive from one node and transmit to another node as in [27]. 
The second node needs to get a sufficiently stronger signal from the AP. If the AP knows the path losses 
between nodes, it can select a suitable node pairs, and power control can be used for both transmissions. 
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No Tx/Rx switching: The duration, e.g. between DATA and ACK, could potentially be reduced since 
there is no time needed to switch from receive to transmit. 

Intelligent choice of full- duplex vs. half-duplex: The RTS and CTS packets offer an opportunity for the 
nodes A and B to estimate the signal quality which can allow for an informed choice of rate tuple and 
the full-duplex vs. half-duplex decision. 

VIII. Conclusions 

This paper presents the first design for a full duplex multi-antenna 20 MHz WiFi-ready design. We 
achieved the best self-interference cancellation reported till date, and presented an integrated PHY and 
MAC design that is compatible with IEEE 802.1 lx, enabling accelerated adoption of FD wireless. Our 
design achieves high rate and extended range, adequate for most indoor WiFi deployments. We believe that 
our results conclusively show that full-duplex WiFi is possible and can be highly beneficial in practical 
propagation environments. 
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Appendix 

In this section, we describe a detailed sequence of steps for a full-duplex data exchange between an 
AP and a STA as per our design. The corresponding states (denoted by using {}) are shown in the flow 
chart in Figure 1 1 . 

1) An idle STA {Tl} gets a packet from its higher layer which it needs to send to AP {Tl}. 

2) STA may need to wait {T2} if it has not waited already. STA listens on the desired channel. 

3) If channel is idle (i.e. no active transmitters), STA sends a RTS frame {T3} to AP with the duration 
field specified in the network allocation vector (NAV) based on the frame length. After sending RTS {T4}, 
STA waits for a CTS response {T5} 

4) Assuming correct transmission, AP goes from {Tl} to {Rl} to {R2} to {R3} while receiving the 
RTS. The AP gets the RTS {R4} and needs to send a CTS. If it has a packet to send to STA, full-duplex 
mode will be used. The scheduler in AP searches for a packet destined for STA and ensures that it is 
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at head of the transmit queue for transmission {T15}. It prepares the CTS {T16} and waits for a Short 
Inter Frame Space (SIFS) period {T2}, and then sends a CTS frame {T2}. 

5) Any third node that overhears the CTS remains silent and ignores all collisions on the channel till the 
end of NAV duration in the CTS. 

6) STA starts receiving the CTS, goes through {T5-R1-R2-R3-R7} to get the CTS, and then prepares to 
send the data frame {T6}, waits for SIFS {T3} and then sends the primary FD data frame FDDATA1 
{T3}. 

7) If the AP has data to send, after sending CTS {T17}, it prepares the full-duplex data frame {T18}, 
waits for SIFS {T2} and then transmits the secondary FD data frame FDDATA2 {T3}. 

8) Both data frames FDDATA1 and FDDATA2 travel in different directions at the same time. Both nodes 
wait for ACK after sending data {T8}. Both nodes receive the data {R5} and need to respond with an 
ACK. Since they were expecting an ACK, they set an FDACK flag to remember to come back and wait 
for the ACK. Then they prepare the ACK {T13}, wait for SIFS time {T2} after any data transmission has 
finished, and send the ACK {T3-T14} and come back to wait for their expected ACK {T8}. The ACK 
timeout is adjusted to the end of the NAV to allow the other node to finish transmitting a data frame that 
may be longer. 

9) At each node, when the ACK is received as expected {R1-R2-R3-R6}, the node waits for DIFS and 
for a random backoff period {T2} before contending for the channel for the next transmission. 
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Fig. 1. Block diagram of a full-duplex OFDM node with two transmitter antennas and one receiver antenna (2x1) using passive suppression 
and active analog self-interference cancellation. Blocks used for active analog cancellation are highlighted in gray. Passive suppression consists 
in propagation loss through hi,\,\ and hi,2,i. The Tx Radios are radio chains that up-convert from baseband (BB) to RF. The Rx radios are 
radio chains that down-convert from RF to BB. 



Antenna Placement 1 (Al) Antenna Placement 2 (A2) 
T2 is orthogonal to T1/R2 T2 is parallel to T1/R2 
R1/T3 is parallel to Tl R1/T3 is orthogonal to Tl 



T2 R1/T3 




TABLE I 

Different antenna configurations with distance between parallel antennas of 37.5 cm 
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Link 


Node 


Physical 


Type of 


Walls 


number 


pair 


Distance (m) 


channel 


crossed 


i 
i 




D.J 


T OS 


n 
w 


2 


Nd, Ne 


4.4 


NLOS 


1 


3 


Nb, Ne 


9.3 


NLOS 


1 


4 


Nc, Ne 


11.3 


NLOS 


1 


5 


Na, Ne 


14.8 


NLOS 


1 


6 


Nb, Nc 


6.5 


NLOS 


2 


7 


Nb, Nd 


8.3 


NLOS 


2 


8 


Na, Nc 


4.7 


NLOS 


3 


9 


Nc, Nd 


8.2 


NLOS 


3 


10 


Na, Nd 


12.7 


NLOS 


3 



TABLE II 

Links considered in experiments 





Number of 


Number of 


Number of 


Total number 


System 


antennas 


up-converting radio 


down-converting radio 


of radio chains 




per node 


chains per node 


chains per node 


per node 


FD 2x1 


3 


3 


1 


4 


HD 3x1 


3 


3 


1 


4 


HD 2x2 


2 


2 


2 


4 


FD lxl 


2 


2 


1 


3 


HD 2x1 


2 


2 


1 


3 



TABLE III 

Number of antennas and radios per node used by the different full-duplex and half-duplex systems that were 

evaluated via experiments. 
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-Al No Device FD lxl 

■ Al No Device FD 2x1 

■ Al With Device FD lxl 

■ Al With Device FD 2x1 
A2 No Device FD lxl 
A2 No Device FD 2x1 
A2 With Device FD lxl 
A2 With Device FD 2x1 



0.6 



Q 

U 



0.4 




■ A2 With Device FD 2x 1 Proposed Canceller 

- A2 With Device FD 2x1 FFC1 Canceller 
A2 With Device FD 2x1 FFC2 Canceller 

■Al No Device FD 2x1 Proposed Canceller 

- Al No Device FD 2x1 FFC1 Canceller 
Al No Device FD 2x1 FFC2 Canceller 



60 70 
Cancellation (dB) 



90 



15 20 25 

Cancellation (dB) 



30 



(a) Passive Cancellation for four different antenna place- 
ments 



(b) Analog cancellation for different analog cancelers. The 
results are shown for two different placements. 




A2 With Device FD 2x1 Proposed Canceller 
A2 With Device FD 2x1 FFC1 Canceller 
A2 With Device FD 2x1 FFC2 Canceller 
Al No Device FD 2x1 Proposed Canceller 
Al No Device FD 2x1 FFC1 Canceller 
Al No Device FD 2x1 FFC2 Canceller 



5 10 
Cancellation (dB) 



75 80 85 

Cancellation (dB) 



100 



(c) Digital Cancellation when the same digital canceler (d) Total cancellation for systems which have the same passive 
is applied after different analog cancelers. The results are suppression and digital cancellation mechanisms but different 
shown for two different placements. analog cancelers. The results are shown for two different 

placements. 




Cancellation (dB) 



(e) Total Cancellation for different antenna placements and 
using our proposed per subcarrier analog canceler and 
digital canceler. 



Fig. 3. CDF of the amount of cancellation for different cancelers and different antenna placements. 
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P2P Cancellation Coefficient Magnitude (dB) 



(a) Cancellation coefficient per subcarrier captured for two (b) CDF of the peak-to-peak value of the cancellation 
subsequent packets. coefficient magnitude. 

Fig. 4. Characterization of the effect of passive cancellation on the frequency response of the self-interference channel. 




Al No Device FD lxl 
Al No Device FD 2x1 
Al With Device FD lxl 
Al With Device FD 2x1 
A2 No Device FD lxl 
A2 No Device FD 2x1 
A2 With Device FD lxl 
A2 With Device FD2xl 
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Fig. 5. Residual self-interference INR (INR after all cancellation stages). 




(a) Results for antennas placed around the device (device in the 
middle) 
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(b) Results for antennas without device in the middle. 



Fig. 6. Ergodic rate as a function of the SNR and and linear fit approximations. 





Fig. 8. Ergodic rate vs. SNR performance for different analog cancelers 
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Fig. 9. Full-duplex MAC frames for 802.11a 



Parameter 


Value 


# Nodes 


2 (AP+1) to 9 (AP+8) 


Transmit power 


9 dBm 


AP to node distance 


14m 


Center frequency 


2400 MHz 


Free space path loss 


63 dB 


RSSI 


-59 dBm 


Self-interf. cancellation 


Default 85 dB 


Modulation 


QPSK, 18 Mbps 


Data Asymmetry 


5% to 100% 


Packet size 


40 to 1500 bytes uplink, 1500 bytes downlink 


BER model 


Q-function, treating self-interference as noise 


Traffic model 


Full buffer 


AP Queue 


Single queue 


Buffer size 


25600 KB 


Simulation time 


10 sec 



TABLE IV 

MAC SIMULATION PARAMETER SET 




HD 18Mbps QPSK 
HD 36Mbps 16-QAM 
■■" HD 54Mbps 64-QAM 
•-FD 18Mbps QPSK 
■♦-FD 36Mbps 16-QAM 
FD 54Mbps 64-QAM 



72 74 76 78 80 82 84 86 
Self-Interference Cancellation (dB) 



88 



Fig. 10. Performance of different modulations and constellations vs. self-interference cancellation, where half-duplex system does not use 
RTS/CTS signalling. 
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AP-Node 
(bytes) 


Node-AP 
(bytes) 


HD w/o RTS 

(Mbps) 


HD with RTS 
(Mbps) 


FD 

(Mbps) 


Gain 

(HD w/o rts) 


Gain 

(HD with rts) 


1500 


1500 


6.92+6.77 


6.41+6.39 


12.81+12.81 


1.87 


2.00 


1500 


1000 


7.85+5.26 


7.21+4.92 


12.80+8.54 


1.62 


1.76 


1500 


500 


9.12+3.05 


8.38+2.82 


12.80+4.27 


1.40 


1.52 


1500 


40 


10.83+0.3 


9.81+0.26 


12.80+0.34 


1.18 


1.30 



TABLE V 

GOODPUT FOR VARYING PACKET SIZES, AP + 1 STA, 85 DB CANCELLATION, THE GOODPUT NUMBERS ARE WRITTEN AS GOODPUT 
FROM AP TO STATION (DOWNLINK) + GOODPUT FROM STATION TO AP (UPLINK). 





Sum Goodput 


Average Goodput per station (Mbps) 




(Mbps) 


AP to HD STA 
(HD downlink) 


AP to FD STA/FD STA to AP 
(FD downlink=uplink) 


HD STA to AP 
(HD uplink) 


2m FD 
2m HD 
m FD, m HD case 1 


2 
1 

1+ 

n+1 


i 

n(n+l) 
m(n+l) 


l 

n 
1 

n+1 


i 

n+1 
1 

n+1 



TABLE VI 

Theoretical normalized goodput for different scenarios, where number of nodes n = 2m and Case 1 is the case 

WITH MODIFIED HALF-DUPLEX NODES. 





Sum Goodput 


Average Goodput (Mbps) 


% RTS/Data 




(Mbps) 


AP to STA 
(downlink) 


STA to AP 
(uplink) 


Collisions 


n = 1 


FD 


25.62 


12.81 


12.81 


10.8 


HD with RTS 


12.8 


6.41 


6.39 


11.1 


HD without RTS 


13.69 


6.92 


6.77 


10.7 


n = 2 


FD 


26.02 


6.52 


6.52 


8.1 


HD with RTS 


12.82 


2.12 


4.29 


17.8 


HD without RTS 


13.25 


2.13 


4.49 


17.8 


n = 4 


FD 


25.99 


3.25 


3.25 


18.2 


HD with RTS 


12.78 


0.67 


2.53 


26.9 


HD without RTS 


12.57 


0.63 


2.49 


26.7 


n = 8 


FD 


25.6 


1.6 


1.6 


32.3 


HD with RTS 


12.7 


0.2 


1.39 


40.0 


HD without RTS 


11.8 


0.16 


1.31 


35.8 



TABLE VII 
GOODPUTS WITH MULTIPLE NODES. 
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Sum Goodput 


Average Goodput (Mbps) 




(Mbps) 


AP to HD STA 


AP to FD STA/FD STA to AP 


HD STA to AP 






(HD downlink) 


(FD downlink=uplink) 


(HD uplink) 


m—1 


2m FD 


26.07 




6.52 




2m HD 


12.82 


2.12 




4.29 


m FD, m HD case 1 


17.87 


3.81 


4.81 


4.45 


m FD, m HD case 2 


18.64 


4.79 


5.65 


2.55 


m = 2 


2m FD 


25.99 




3.25 




2m HD 


12.78 


0.67 




2.53 


m FD, m HD case 1 


18.15 


1.37 


2.59 


2.54 


m FD, m HD case 2 


20.59 


1.67 


3.78 


1.07 


m = 4 


2m FD 


25.6 




1.6 




2m HD 


12.7 


0.2 




1.39 


m FD, m HD case 1 


18.37 


0.5 


1.38 


1.32 


m FD, m HD case 2 


21.86 


0.59 


2.23 


0.42 



TABLE VIII 

GOODPUTS WITH COEXISTENCE FOR MULTIPLE NODES WHERE IN CASE 1, HD NODES IGNORE COLLISIONS IN NAV WHILE IN CASE 2, 

HD NODES ARE LEGACY AND DO NOT IGNORE COLLISIONS. 
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Fig. 11. 



FD-MAC flowchart 



